Overview

Dataset info

Number of variables16
Number of observations374860
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory45.8 MiB
Average record size in memory128.0 B

Variables types

Numeric6
Categorical8
Boolean0
Date0
URL0
Text (Unique)0
Rejected2
Unsupported0

Warnings

backers is highly skewed (γ1 = 86.34074093) Skewed
backers has 51811 (13.8%) zeros Zeros
category has a high cardinality: 159 distinct values Warning
deadline only contains datetime values, but is categorical. Consider applying pd.to_datetime()Type
deadline has a high cardinality: 3164 distinct values Warning
goal is highly skewed (γ1 = 70.44444164) Skewed
launched only contains datetime values, but is categorical. Consider applying pd.to_datetime()Type
launched has a high cardinality: 374298 distinct values Warning
name has a high cardinality: 372068 distinct values Warning
pledged is highly skewed (γ1 = 74.9629582) Skewed
pledged has 51808 (13.8%) zeros Zeros
usd_goal_real is highly correlated with goal (ρ = 0.9426905837) Rejected
usd_pledged is highly skewed (γ1 = 105.8993653) Skewed
usd_pledged has 68111 (18.2%) zeros Zeros
usd_pledged_real is highly correlated with usd_pledged (ρ = 0.9077433634) Rejected

Variables

backers
Numeric

Distinct count3963
Unique (%)1.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean106.6883663
Minimum0
Maximum219382
Zeros (%)13.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q12
Median12
Q357
95-th percentile338
Maximum219382
Range219382
Interquartile range55

Descriptive statistics

Standard deviation911.7101242
Coef of variation8.545543958
Kurtosis13818.22311
Mean106.6883663
MAD147.8618611
Skewness86.34074093
Sum39993201
Variance831215.3506
Memory size2.9 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 5.00000e-01 1.50000e+00 2.50000e+00 3.50000e+00 ... 1.28190e+04 1.85810e+04 3.68220e+04 8.93635e+04 2.19382e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 51811 13.8%
 
1 34868 9.3%
 
2 23196 6.2%
 
3 16063 4.3%
 
4 12068 3.2%
 
5 9715 2.6%
 
6 8137 2.2%
 
7 7014 1.9%
 
8 6198 1.7%
 
9 5553 1.5%
 
Other values (3953) 200237 53.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0 51811 13.8%
 
1 34868 9.3%
 
2 23196 6.2%
 
3 16063 4.3%
 
4 12068 3.2%
 

Maximum 5 values

ValueCountFrequency (%) 
219382 1 < 0.1%
 
154926 1 < 0.1%
 
105857 1 < 0.1%
 
91585 1 < 0.1%
 
87142 1 < 0.1%
 

category
Categorical

Distinct count159
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Product Design
 
22310
Documentary
 
16138
Tabletop Games
 
14178
Other values (156)
322234
ValueCountFrequency (%) 
Product Design 22310 6.0%
 
Documentary 16138 4.3%
 
Tabletop Games 14178 3.8%
 
Music 13340 3.6%
 
Shorts 12357 3.3%
 
Video Games 11828 3.2%
 
Food 11492 3.1%
 
Film & Video 9224 2.5%
 
Fiction 9168 2.4%
 
Fashion 8554 2.3%
 
Other values (149) 246271 65.7%
 
Max length18
Mean length9.062695406
Min length3
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

country
Categorical

Distinct count22
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
US
292624
GB
 
33671
CA
 
14756
Other values (19)
 
33809
ValueCountFrequency (%) 
US 292624 78.1%
 
GB 33671 9.0%
 
CA 14756 3.9%
 
AU 7839 2.1%
 
DE 4171 1.1%
 
FR 2939 0.8%
 
IT 2878 0.8%
 
NL 2868 0.8%
 
ES 2276 0.6%
 
SE 1757 0.5%
 
Other values (12) 9081 2.4%
 
Max length2
Mean length2
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

currency
Categorical

Distinct count14
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
USD
292624
GBP
 
33671
EUR
 
17219
Other values (11)
 
31346
ValueCountFrequency (%) 
USD 292624 78.1%
 
GBP 33671 9.0%
 
EUR 17219 4.6%
 
CAD 14756 3.9%
 
AUD 7839 2.1%
 
SEK 1757 0.5%
 
MXN 1752 0.5%
 
NZD 1447 0.4%
 
DKK 1113 0.3%
 
CHF 761 0.2%
 
Other values (4) 1921 0.5%
 
Max length3
Mean length3
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

deadline
Categorical

Distinct count3164
Unique (%)0.8%
Missing (%)0.0%
Missing (n)0
2014-08-08
 
702
2014-08-10
 
556
2014-08-07
 
541
Other values (3161)
373061
ValueCountFrequency (%) 
2014-08-08 702 0.2%
 
2014-08-10 556 0.1%
 
2014-08-07 541 0.1%
 
2014-08-09 473 0.1%
 
2015-05-01 463 0.1%
 
2015-07-01 437 0.1%
 
2014-08-15 421 0.1%
 
2015-04-01 420 0.1%
 
2014-08-14 411 0.1%
 
2014-08-31 411 0.1%
 
Other values (3154) 370025 98.7%
 
Max length10
Mean length10
Min length10
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

df_index
Numeric

Distinct count374860
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean189316.3371
Minimum0
Maximum378660
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile18947.95
Q194645.75
Median189315.5
Q3283972.25
95-th percentile359724.05
Maximum378660
Range378660
Interquartile range189326.5

Descriptive statistics

Standard deviation109311.5669
Coef of variation0.5774016579
Kurtosis-1.200108674
Mean189316.3371
MAD94667.33873
Skewness0.000113888787
Sum7.096712214e+10
Variance1.194901867e+10
Memory size2.9 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 378660.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
72310 1 < 0.1%
 
92792 1 < 0.1%
 
90745 1 < 0.1%
 
96890 1 < 0.1%
 
84604 1 < 0.1%
 
82557 1 < 0.1%
 
88702 1 < 0.1%
 
86655 1 < 0.1%
 
305856 1 < 0.1%
 
Other values (374850) 374850 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
378660 1 < 0.1%
 
378659 1 < 0.1%
 
378658 1 < 0.1%
 
378657 1 < 0.1%
 
378656 1 < 0.1%
 

goal
Numeric

Distinct count8312
Unique (%)2.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean49522.98884
Minimum0.01
Maximum100000000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.01
5-th percentile400
Q12000
Median5500
Q316500
95-th percentile90000
Maximum100000000
Range99999999.99
Interquartile range14500

Descriptive statistics

Standard deviation1189361.601
Coef of variation24.0163534
Kurtosis5519.119935
Mean49522.98884
MAD73440.87686
Skewness70.44444164
Sum1.85641876e+10
Variance1.414581018e+12
Memory size2.9 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.0000000e-02 7.5000000e-01 1.4250000e+00 4.5000000e+00 5.5000000e+00 ... 1.0029933e+07 2.5500000e+07 5.6500000e+07 9.9500000e+07 1.0000000e+08], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 29180 7.8%
 
10000 25980 6.9%
 
1000 16876 4.5%
 
3000 15348 4.1%
 
2000 14903 4.0%
 
15000 14222 3.8%
 
20000 13092 3.5%
 
500 11597 3.1%
 
2500 11589 3.1%
 
25000 10365 2.8%
 
Other values (8302) 211708 56.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0.01 2 < 0.1%
 
0.15 1 < 0.1%
 
0.5 1 < 0.1%
 
1 430 0.1%
 
1.85 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
100000000 36 < 0.1%
 
99000000 2 < 0.1%
 
80000000 2 < 0.1%
 
75000000 1 < 0.1%
 
73000000 1 < 0.1%
 

ID
Numeric

Distinct count374860
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1074652356
Minimum5971
Maximum2147476221
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum5971
5-th percentile108665913.6
Q1538063323
Median1075289160
Q31610137351
95-th percentile2039710999
Maximum2147476221
Range2147470250
Interquartile range1072074028

Descriptive statistics

Standard deviation619136772.6
Coef of variation0.5761274978
Kurtosis-1.198059604
Mean1074652356
MAD536084923.8
Skewness-0.002572662134
Sum4.02844182e+14
Variance3.833303432e+17
Memory size2.9 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[5.97100000e+03 2.14747622e+09], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1294469119 1 < 0.1%
 
19964225 1 < 0.1%
 
1371058499 1 < 0.1%
 
2010155332 1 < 0.1%
 
400589125 1 < 0.1%
 
794451597 1 < 0.1%
 
487139804 1 < 0.1%
 
292614771 1 < 0.1%
 
644411723 1 < 0.1%
 
486666825 1 < 0.1%
 
Other values (374850) 374850 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
5971 1 < 0.1%
 
18520 1 < 0.1%
 
21109 1 < 0.1%
 
21371 1 < 0.1%
 
24380 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2147476221 1 < 0.1%
 
2147472329 1 < 0.1%
 
2147466649 1 < 0.1%
 
2147460119 1 < 0.1%
 
2147455254 1 < 0.1%
 

launched
Categorical

Distinct count374298
Unique (%)99.9%
Missing (%)0.0%
Missing (n)0
1970-01-01 01:00:00
 
7
2014-07-08 19:55:03
 
2
2017-03-20 20:11:00
 
2
Other values (374295)
374849
ValueCountFrequency (%) 
1970-01-01 01:00:00 7 < 0.1%
 
2014-07-08 19:55:03 2 < 0.1%
 
2017-03-20 20:11:00 2 < 0.1%
 
2014-11-11 22:00:42 2 < 0.1%
 
2014-07-10 16:00:06 2 < 0.1%
 
2017-08-07 18:49:57 2 < 0.1%
 
2017-02-07 22:00:01 2 < 0.1%
 
2017-01-31 18:06:47 2 < 0.1%
 
2015-02-24 00:32:16 2 < 0.1%
 
2015-02-20 03:29:57 2 < 0.1%
 
Other values (374288) 374835 > 99.9%
 
Max length19
Mean length19
Min length19
Contains charsFalse
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

main_category
Categorical

Distinct count15
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Film & Video
62696
Music
49530
Publishing
 
39379
Other values (12)
223255
ValueCountFrequency (%) 
Film & Video 62696 16.7%
 
Music 49530 13.2%
 
Publishing 39379 10.5%
 
Games 35225 9.4%
 
Technology 32562 8.7%
 
Design 30066 8.0%
 
Art 28152 7.5%
 
Food 24599 6.6%
 
Fashion 22812 6.1%
 
Theater 10912 2.9%
 
Other values (5) 38927 10.4%
 
Max length12
Mean length7.462930161
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

name
Categorical

Distinct count372068
Unique (%)99.3%
Missing (%)0.0%
Missing (n)0
New EP/Music Development
 
13
Canceled (Canceled)
 
13
Music Video
 
11
Other values (372065)
374823
ValueCountFrequency (%) 
New EP/Music Development 13 < 0.1%
 
Canceled (Canceled) 13 < 0.1%
 
Music Video 11 < 0.1%
 
N/A (Canceled) 11 < 0.1%
 
New EP / Music Development 10 < 0.1%
 
Cancelled (Canceled) 10 < 0.1%
 
Debut Album 9 < 0.1%
 
Reflections 9 < 0.1%
 
The Journey 9 < 0.1%
 
The Other Side 8 < 0.1%
 
Other values (372058) 374757 > 99.9%
 
Max length96
Mean length34.86675292
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

pledged
Numeric

Distinct count61936
Unique (%)16.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean9750.538301
Minimum0
Maximum20338986.27
Zeros (%)13.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q131
Median620
Q34080
95-th percentile30010.05
Maximum20338986.27
Range20338986.27
Interquartile range4049

Descriptive statistics

Standard deviation96010.93751
Coef of variation9.846732
Kurtosis9952.710565
Mean9750.538301
MAD14311.86115
Skewness74.9629582
Sum3655086787
Variance9218100121
Memory size2.9 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 5.00000000e-01 1.00500000e+00 1.24000000e+00 1.99500000e+00 ... 1.12389844e+06 1.73035349e+06 3.41529830e+06 6.81926975e+06 2.03389863e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 51808 13.8%
 
1 9023 2.4%
 
10 4982 1.3%
 
25 3958 1.1%
 
50 3594 1.0%
 
5 3577 1.0%
 
20 3196 0.9%
 
100 3030 0.8%
 
2 2399 0.6%
 
30 2110 0.6%
 
Other values (61926) 287183 76.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 51808 13.8%
 
1 9023 2.4%
 
1.01 5 < 0.1%
 
1.02 3 < 0.1%
 
1.03 3 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
20338986.27 1 < 0.1%
 
13285226.36 1 < 0.1%
 
12779843.49 1 < 0.1%
 
12393139.69 1 < 0.1%
 
10266845.74 1 < 0.1%
 

state
Categorical

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
failed
197611
successful
133851
canceled
 
38757
Other values (2)
 
4641
ValueCountFrequency (%) 
failed 197611 52.7%
 
successful 133851 35.7%
 
canceled 38757 10.3%
 
live 2798 0.7%
 
suspended 1843 0.5%
 
Max length10
Mean length7.634879688
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

usd_goal_real
Highly correlated

This variable is highly correlated with goal and should be ignored for analysis

Correlation0.9426905837

usd_pledged
Numeric

Distinct count95454
Unique (%)25.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7036.802252
Minimum0
Maximum20338986.27
Zeros (%)18.2%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q116.98
Median394.765
Q33034.425
95-th percentile22433.0475
Maximum20338986.27
Range20338986.27
Interquartile range3017.445

Descriptive statistics

Standard deviation78640.16167
Coef of variation11.17555373
Kurtosis18960.72249
Mean7036.802252
MAD10320.83182
Skewness105.8993653
Sum2637815692
Variance6184275027
Memory size2.9 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 2.35000000e-01 5.55000000e-01 7.05000000e-01 7.45000000e-01 ... 9.43430540e+05 1.64513623e+06 3.41529830e+06 6.27932538e+06 2.03389863e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 68111 18.2%
 
1 5341 1.4%
 
25 3877 1.0%
 
10 3624 1.0%
 
50 3141 0.8%
 
100 2673 0.7%
 
5 2598 0.7%
 
20 2473 0.7%
 
30 1717 0.5%
 
2 1451 0.4%
 
Other values (95444) 279854 74.7%
 

Minimum 5 values

ValueCountFrequency (%) 
0 68111 18.2%
 
0.47 3 < 0.1%
 
0.48 1 < 0.1%
 
0.51 1 < 0.1%
 
0.52 3 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
20338986.27 1 < 0.1%
 
13285226.36 1 < 0.1%
 
12779843.49 1 < 0.1%
 
10266845.74 1 < 0.1%
 
9192055.66 1 < 0.1%
 

usd_pledged_real
Highly correlated

This variable is highly correlated with usd_pledged and should be ignored for analysis

Correlation0.9077433634

Correlations

Missing values

Sample

First rows

backerscategorycountrycurrencydeadlinedf_indexgoalIDlaunchedmain_categorynamepledgedstateusd_goal_realusd_pledgedusd_pledged_real
00PoetryGBGBP2015-10-0901000.010000023302015-08-11 12:12:28PublishingThe Songs of Adelaide & Abullah0.00failed1533.950.000.00
115Narrative FilmUSUSD2017-11-01130000.010000039302017-09-02 04:43:57Film & VideoGreeting From Earth: ZGAC Arts Capsule For ET2421.00failed30000.00100.002421.00
23Narrative FilmUSUSD2013-02-26245000.010000040382013-01-12 00:20:50Film & VideoWhere is Hank?220.00failed45000.00220.00220.00
31MusicUSUSD2012-04-1635000.010000075402012-03-17 03:24:11MusicToshiCapital Rekordz Needs Help to Complete Album1.00failed5000.001.001.00
414Film & VideoUSUSD2015-08-29419500.010000110462015-07-04 08:35:03Film & VideoCommunity Film Project: The Art of Neighborhoo...1283.00canceled19500.001283.001283.00
5224RestaurantsUSUSD2016-04-01550000.010000140252016-02-26 13:38:27FoodMonarch Espresso Bar52375.00successful50000.0052375.0052375.00
616FoodUSUSD2014-12-2161000.010000234102014-12-01 18:30:44FoodSupport Solar Roasted Coffee & Green Energy! ...1205.00successful1000.001205.001205.00
740DrinksUSUSD2016-03-17725000.010000305812016-02-01 20:05:12FoodChaser Strips. Our Strips make Shots their B*tch!453.00failed25000.00453.00453.00
858Product DesignUSUSD2014-05-298125000.010000345182014-04-24 18:14:43DesignSPIN - Premium Retractable In-Ear Headphones w...8233.00canceled125000.008233.008233.00
943DocumentaryUSUSD2014-08-10965000.01000041952014-07-11 21:55:48Film & VideoSTUDIO IN THE SKY - A Documentary Feature Film...6240.57canceled65000.006240.576240.57

Last rows

backerscategorycountrycurrencydeadlinedf_indexgoalIDlaunchedmain_categorynamepledgedstateusd_goal_realusd_pledgedusd_pledged_real
37485078Classical MusicCACAD2014-03-223786515000.09999698122014-02-20 01:00:16MusicAT THE BEACH5501.0successful4529.815019.924983.69
37485136DocumentaryNONOK2015-04-2837865220000.09999718982015-03-29 21:30:33Film & VideoBeach Wrestling Documentary21500.0successful2675.192698.972875.83
3748521DocumentaryUSUSD2012-03-163786531700.09999722642012-02-15 04:31:10Film & VideoIslanda25.0failed1700.0025.0025.00
3748534Small BatchUSUSD2017-04-193786546500.09999758362017-03-20 22:08:22FoodHomemade fresh dog food, Cleveland OH154.0failed6500.000.00154.00
3748540PoetryCACAD2014-09-203786555500.09999763122014-08-06 03:46:07PublishingAngela's Poetry (Canceled)0.0canceled4949.600.000.00
3748551DocumentaryUSUSD2014-10-1737865650000.09999764002014-09-17 02:35:30Film & VideoChknTruk Nationwide Charity Drive 2014 (Canceled)25.0canceled50000.0025.0025.00
3748565Narrative FilmUSUSD2011-07-193786571500.09999776402011-06-22 03:35:14Film & VideoThe Tribe155.0failed1500.00155.00155.00
3748571Narrative FilmUSUSD2010-08-1637865815000.09999863532010-07-01 19:40:30Film & VideoWalls of Remedy- New lesbian Romantic Comedy f...20.0failed15000.0020.0020.00
3748586TechnologyUSUSD2016-02-1337865915000.09999879332016-01-13 18:13:53TechnologyBioDefense Education Kit200.0failed15000.00200.00200.00
37485917Performance ArtUSUSD2011-08-163786602000.09999882822011-07-19 09:07:47ArtNou Renmen Ayiti! We Love Haiti!524.0failed2000.00524.00524.00